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AUSTRACT 


A  stabilizing  control  design  for  general  linear  time  vary¬ 
ing  systems  is  presented  and  analyzed.  The  control  is  a 
state-feedback  law  with  gains  determined  by  a  standard 
method  employed  in  optimal  regulator  problems.  The  con¬ 
sidered  cost  function  is.  however,  dynamically  redefined 
over  a  fixed  depth  horizon.  Hie  method  is  shown  to  yield  a 
stable  closed  loop  system  and  computationally  efficient 
recursions  tor  the  feedback  gain  are  provided. 


1.  INTRODUCTION 

Several  approaches  exist  for  the  design  of  stabilizing 
control  laws  for  linear  time-invariant  systems.  Along  with 
the  classical  frequency-domain  techniques,  the  "modern" 
performance  index  optimization  methods  further 
guaranteed  robust,  stabilizing  feedback  controls  [l].  The 
situation  is  quite  different,  however,  for  time-varying  linear 
systems.  While  an  ''optimal",  stabilizing  state-feedback  gain 
for  a  time-invariant  system  can  be  found  by  solving  an  alge¬ 
braic  Riccati  equation,  the  analogous  solution  for  the  time- 
varying  case  requires  the  backwards  iteration  of  a  matrix 
differential  equation  over  an  infinite  horizon  [2],[3].  Obvi¬ 
ously.  this  is  not  a  practical  way  to  obtain  a  stabilizing  con¬ 
trol.  The  problem  arises  from  the  fact  that  the  con  putation 
of  the  gain  at  every  instant  of  time  requires,  in  principle, 
the  optimization  of  a  performance  index  over  an  infinite 
time  span  into  the  future.  A  natural  way  to  try  to  overcome 
the  computational  difficulty  is  to  assume  that  at  ail 
moments  f  we  have  to  find  the  optimal  control  for  a  fixed, 
finite  horizon  of  depth  T  [4], [5]. 

The  standard  regulator  problem  poses  the  question  of 
determining  the  optimal  control  u*(.)  to  be  applied  to  a 
linear  system  in  order  to  minimize  a  cost  functional  over  a 
given  interval  (f«.f/).  In  the  case  of  a  quadratic  cost  the 
resulting  control  is  a  simple  suite-feedback  law,  the  gain 
computation  involving  the  well-known  backwards  Riccati 
equation  [l].  The  control  applied  at  time  f.  given  a  ''sliding*' 
horizon  of  fixed  depth  T,  would  therefore  be  the  initial  step 
in  minimizing  a  quadratic  performance  index  over  (f  .1  +  T). 
It  is  also  immediate  that  this  procedure  leads  to  a  state- 
feedback  control  law,  the  gain  being  computed  through  a 
Riccati  recursion  starting  at  f  +  T  and  proceeding  backwards 
tof  [41. 

Although  very  reasonable  in  concept,  this  receding  hor¬ 
izon  procedure  docs  not  have  any  obvious  interpretation  in 
terms  of  optimizing  u  performance  index  over  some 
predetermined  interval  (14.1/):  it s  value  lies  instead  in  the 
fact  that  it  yields  a  practical  and  computationally  eflicicnt 
technique  for  stabilizing  general  litnc-varying  linear  sys¬ 
tems.  Also,  it  is  worth  mentioning  that  this  is  a  proper  gen¬ 
eralization  of  Kloinman's  "easy"  way  to  stabilize  a  time- 


invariant  systems  through  state-feedback  [6]. 

Since  computing  the  feedback  gain  for  the  above 
defined  control  law  requires,  in  principle,  completing  a 
backwards  recursion  over  an  interval  of  length  T  for  all  t  it 
is  not  obvious  that  this  procedure  can  be  rendered  compu¬ 
tationally  feasible.  In  the  sequel  wc  shall  show  that  in  fact 
one  can  derive  a  recursive  algorithm  that  updates  the  con¬ 
trol  gain  directly,  avoiding  the  necessity  to  solve  a  back¬ 
wards  Riccati  equation  over  and  over  again.  This  algorithm 
is  derived  through  a  convenient  embedding  of  the  feedback 
gain  in  a  suitably  defined  scattering  matrix  and  using  some 
well-known  results  of  Redheflcr  scattering  theory  [7].  We 
shall  then  address  the  problem  of  system  stability,  estab¬ 
lishing  that,  under  certain  uniform  controllability  condi¬ 
tions  and  some  conditions  on  the  moving  interval  cost  func¬ 
tion.  the  closed  loop  system  becomes  asymptotically  stable. 

A  suboptimal  state  estimator,  the  structural  dual  sys¬ 
tem  of  the  receding  horizon  controller,  is  introduced  and 
briefly  discussed  in  the  last  section  of  the  paper. 

Z.  MOVING  HORIZON  CONTROI.  LAWS 

Consider  the  time-varying  linear  system  described  by: 

*{t )  =  4x(O+0|«(O  (ZD 

where  x(.)el?"  andu(.)ef?m.  Let  /  be  a  standard  quadratic 
cost  functional  over  a  fixed  interval  (f«  f/)  defined  as 

•/ 

J  =  /  1*'(t)9t*(z)  +  u'(r)/?Tiz(r)]dr  +  x'{tf)Ftx(lf)  (2.2) 

•t 

Here  Qt .  Rt  and  Ft  are  known,  positive-definite,  symmetric 
matrices,  essentially  design  parameters. 

It  is  well-known,  [l],  that  the  optimal  control  input  that 
minimizes  J  is  provided  by  the  following  state-feedback  law 

«•(«)  =  -Rj  ,t/;Fl/)x(t )  (2.3) 

The  gain  K(t,tt  :Ftf)  is  computed  through  the  backwards 
Riccati  recursion 

=  K(T.tf:Fl/)Ar*A\K(r.t/;Flj) 
-K(T.t/:Ftt)BTBt-'B-TK(T,t/;Ft/)*Qr  (2  4) 
with  Anal  condition  AT(f/,f/;/|/)  =  Ft/ 

The  modified,  receding  horizon  control  un(f)  is  defined 
as  the  input  at  lime  t  that  would  be  needed  to  minimize, 
over  (f.l  +  r).  the  following  criterion 
•  ♦r 

Jm  -  f  1*'(t)9t*(t)  ♦  u'(r)AfTu (T))dr 


♦  *'«  +  r)/ifr*(f  +  r)  (2.5) 
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Therefore  this  control  law  is  also  a  stale-feedback  law,  given 
by: 

u"(f)  =  -Rr'rftK(t.t  +  T;F,*r)*(i)  <z-6) 

the  gain  K(t.t  +  T;F,* r)  being  obtained  from  the  backwards 
nil  equation  (2.4 )  with  1/  replaced  by  t  +  T. 


The  interpretation  of  this  modified  control  is  that,  at 
each  moment  t.  the  input  applied  is  chosen  as  if  the  optimi* 
ration  of  the  criterion  Jn  over  ( t.t  +  T )  was  the  overall 
objective.  It  is  readily  seen,  though,  that  this  control  does 
not  minimize  any  obvious  overall  cost  function  of  the  type 
(2.2)  on  any  given  interval  (<*.f/ ). 

In  principle,  in  order  to  compute  /f(f.f +  r:A|*r)  one 
might  solve  at  all  moments  in  lime  the  backwards  Riccati 
equation,  with  final  condition  al  l*  T  given  by  Fur-  This  is 
not  a  computationally  feasible  approach.  However,  one 
immediately  realizes  that  in  the  case  of  time-invariant  sys¬ 
tems  and  fixed  weighting  matrices  Q ,  R  and  F.  the  moving 
horizon  method  yields  a  constant  feedback  gain.  The  control 
law,  in  this  case,  is  simply  given  by 

«•*(!)  =  - R-'BKTx(t )  (2.7) 

where  Kj  can  be  obtained  from 

4rK,  =  KtA  +  AK,  -  K,BR~lBKt  +  Q  \  fC6  =  F  (2.8) 
at 

The  particular  case  of  Q  =  0  andF -»*»  provides  Klcinman's 
method  for  stabilization  of  a  time-invariant  system  [6]. 

The  general  case  with  /',=«>  turns  out  to  be  very 
important  in  providing  a  stabilizing  moving  horizon  control 
for  time-varying  systems.  The  infinite  weight  assigned  to  the 
final  state  implies  that  the  optimal  control  is  required  to 
take  the  state  to  0  at  the  Anal  time,  while  minimizing  a  qua¬ 
dratic  cost  over  the  given  interval  [4].  In  this  case  the  back¬ 
wards  Riccati  equation  has  infinity  as  initial  condition,  a 
somewhat  ambiguous  starting  point  for  a  recursion.  There¬ 
fore  the  following  well-known  trick  is  invoked:  instead  of 
considering  the  differential  equation  for  K(t  ,f/:“)  derive  a 
recursion  for  P(t.t /)  -  •,*>).  It  is  quite  simple  to 

show  that  the  result  is  the  following  Riccati  equation 

-£#•(<.<,)  =  -P(t.t,)A,  -A,P( t.t,) 

-  P(t.t/)QP(t.t/)  +  (2  9) 

Now,  however  the  final  condition  is  P(tf,tj)  =  0.  For  the 
oase  of  Ftf  -  «  the  modified,  moving  horizon  control  is  given 

by: 

«*(f)  =  -*,-,£f«/,-,(f.f+r)*<0  ‘(2-10) 

and  the  computation  requires  the  inversion  of  P(t,t  +  T)  at 
all  t. 

H  was  proved  by  Kalman  in  [2],  that  the  "steady-state” 
time-varying  feedback  gain  K(t  ,«)  stabilizes,  under  uniform 
complete  controllability  assumptions,  the  time-varying  sys¬ 
tem  (2.1).  Practically,  however  there  is  no  way  to  obtain 
The  moving-horizon  control  laws  are  readily  com¬ 
putable  in  principle,  since  they  require  solutions  of  recur¬ 
sions  over  a  finite  time-span.  Even  better  than  that,  we  can 
derive  efficient  gain-update  algorithms  that  remove  the 
necessity  to  re-solve  for  each  time  point  the  backwards  Ric¬ 
cati  equation,  thus  rendering  the ’method  computationally 
efficient. 


3.  GENERAL  GAIN  UPDATE  RECURSIONS 

Using  some  results  from  the  scattering  theory  origi¬ 
nally  developed  by  Kcdhcffcr  for  the  study  of  transmission¬ 
line  problems,  and  then  applied  to  estimation  and  control 
theory  by  Kailath  and  his  coworkers,  we  derive  recursive 
update  algorithms  for  the  gain  required  in  moving  horizon 
controls.  We  refer  to  [7]-[l0)  for  comprehensive  reviews  of 
scattering  theory.  The  main  idea  of  scattering  theory  is  to 
embed  the  Riccati  variable,  K{ t,o),  that  satisfies  a  forwards 
or  backwards  equation,  into  a  scattering  matrix  Sn.  The 
embedding  is  done  by  defining  the  auxiliary  matrices  ♦(r.c), 
♦(r.e)  and  L(r.e)  through  the  following  differential  equa¬ 
tions: 

»  ♦(t.o)[A,  -  ntRT-*ffrK(r.o)]  (3.1) 


-^T*(r.o)  =  [Ar-  K{r,o)BrRr-xirT)*<j<o)  (3.2) 

-j^-Ur.a)  =  ^{r,a)BrRr~xB  t+(t,o)  (3.3) 

Together  with  the  backwards  Riccati  equation  (2.4).  in  which 
the  final  time  has  been  parametrized, 

-£-*(T.e)  =  A\K(t.o)+K(t.o)At 


-K(r.o)BTJtr-'BMT.o)+Qr 


(3.4) 


and  appropriate  Anal  conditions  at  T  =  o,  these  equations 
yield  the  backwards  evolution  of  a  2»  by  2»  scattering 
matrix,  defined  as 


•  *(v.c)  L(t.c) 

=  /f(r.o)  *(t.«) 


with  S„  s 


/  0 
0  / 


(3.5) 


The  above  chosen  final  values  matrix  does  not  display  the 
correct  starting  point  for  the  g*in  recursion.  Since,  how¬ 
ever,  we  also  have  through  scattering  theory  a  method  to 
change  initial  or  final  conditions  of  Riccati  equations  to  arbi¬ 
trary  values,  we  shall  in  the  sequel  concentrate  on  this  gen¬ 
eric  case  of  F,  =  0.  After  the  gain  is  computed  for  this  par¬ 
ticular  final  condition  the  true  gain  can  readily  be  obtained 
using  a  change  of  final  condition  formula  (see  (3.11)). 

Now,  a  two  parameter  matrix  S„  has  been  defined 
through  a  set  of  backwards  differential  equations  together 
with  a  “canonical”  final  condition,  an  identity  matrix  with 
dimension  2n.  In  the  sequel  we  shall  have  to  exploit  the 
properties  of  this  matrix,  in  particular  its  behavior  as  the 
parameter  o  varies  infinitesimally.  In  this  context  we  have 
the  following  basic  result  due  to  Redhcffer  [7]  (see  also  Reid 
[11]): 

THEOREM  3  .1 

Given  that  the  matrix  functions  involved  in  the 
definition  of  the  linear  system  (2.1)  and  the  cost  function 
(2.2)  are  piecewise  continvos.  the  elements  of  the  above 
defined  scattering  matrix  obey  the  following  forwards 
differential  equations: 


^-♦(t.c)  f  [A,  4  L(t. o)p.]*(r,c) 
^-♦(r.o)  =  +(t.«x)(A,  +  0,£(r.o)] 
—L(t,o)  =  A,L{t.o)  +  L(r.o)A, 


*  L{t,o)Q,L{t.o)  -  B,R,~XB, 
*(T.e)9e*(r,ir) 


(3.6) 

(3.7) 

(3.8) 

(3.9) 


the  initial  condition  for  S„  at  o  =  r  being  the  identity 
matrix. 


The  above  result  together  with  the  backwards  equations 
defining  the  scattering  matrix,  provides  the  evolution  of  5„ 
with  respect  to  variations  in  each  index,  T  and  o.  Now,  realiz¬ 
ing  that  the  moving  horizon  gain  is  obtained  as  a  subtnatrix 
of  S(t,t*T)  (up  to  a  change  to  the  true  final  conditions)  we 
need  to  derive  evolution  equations  for  the  case  of  simultane¬ 
ous  variation  in  the  parameters,  according  to  r  =  l  and 
a  =  t  +  F.  We  have  the  following 

THEOREM  3.2 

The  evolution  equations,  for  the  sliding  window 
scattering  matrix,  are  given  by 

~-H(t.t*T)  =  £-S(T.e)|  T.,.„„r 

♦  ^-5(r,e)!  (3.10) 


* 


The  above  relation,  essentially  the  chain  rule  for 
differentiation,  provides  a  complete  set  of  recur  sic...,  for  the 
(zero  final  condition)  feedback  gain  and  the  auxiliary 
matrices.  These  recursions  arc  a  combination  of  the  evolu¬ 
tion  equations  yielding  the  variations  of  i>jg  with  r  and  a 
separately. 

The  initialization  of  the  recursions  of  type  (3.10)  is  done 
by  first  solving  the  backwards  equations  for  an  initial  inter¬ 
val.  say  (f  c,f  „+ 7').  for  both  the  gain  and  the  auxiliary 
matrices. 

In  order  to  find  the  true  feedback  gain  sequence, 
corresponding  to  the  sequence  of  final  conditions  Ft.  we 
shall  use  the  following  formula,  derived  from  a  basic  closure 
property  of  the  Riccati  equation: 

K(t.t+T.F„r)  =  K(t.t  +  T) 

♦  *(t.t  +  T)F,.r[/-I‘(‘-t  +  T)F,tT]~t*(t.t  +  T)  (3.11) 

where  K(t.o\F,)  denotes,  as  before,  the  gain  obtained 
through  the  backwards  Riccati  equation  with  final  condition 

F 

Note  that  (3. 1 1)  provides  the  true  gain  by  operating  on 
the  entries  of  5<p»r)  and  on  Fur  -  More  about  the  deriva¬ 
tions.  essentially  very  simple  when  relying  on  scattering 
theory,  of  the  above  mentioned  results  can  be  found  in  [6]- 
[10]. 

It  is  interesting  to  note  that  in  the  case  of  Ft-*—  we 
obtain  from  (3.11)  the  following  result: 

K{t  .t  +  T:-)=/f(f  .1  +  .t  *  T)L  '(t  .1  +  T)*(t  .t  +  T)  (3. 12) 

which  expresses  the  gain  P~l(t,t  +  T)  in  terms  of  the  recur¬ 
sively  obtained  sliding-window  scattering  matrix. 

Some  extensions  and  numerical  considerations: 

In  the  case  of  constant  fin'll  weighting  in  the  moving 
horizon  criterion,  we  can  also  derive  recursions  of  the  type 
(3.10)  that  provide  the  true  gain  directly,  avoiding  the  need 
for  a  continuous  final  condition  adjustment.  Also  in  case  of 
given  differential  behavior  of  Ft  the  above  simplification  is 
feasible.  In  ail  these  cases  the  recursions  turn  out  to  have 
the  form  of  (3. 10).  the  modifications  amounting  to  predeter¬ 
mined  changes  in  the  matrices  appearing  in  the  recursions 
with  index  /  +  T.  This  is  of  course  not  unexpected,  since  the 
changes  occur  at  the  right-end  boundary  conditions. 

it  is  also  obvious  that  the  same  embedding  can  be  done 
starting  with  the  P(t.t  +  T)  gain,  which  again  obeys  a  back¬ 
wards  Riccati  equation,  and  this  method  will  provide  direct 
recursions  for  it. 

The  above  derived  algorithms,  although  involving  a 
matrix  twice  the  size  of  the  gain  matrix  needed,  have  the 
potential  of  being  computationally  much  more  efficient  than 
solving  the  backwards  Riccati  recursion  at  each  point  in 
time.  (Also  note  that  we  have  considerable  redundancy  in 
the  scattering  matrix,  since  ♦(t.o)  =  ♦’(t.o)  by  definition.) 
Some  numerical  problems  may  however  arise  in  propagating 
the  sliding-window  differential  equations:  the  updating  algo¬ 
rithms  are  likely  to  cumulate  errors  and  thus  the  gains 
obtained  after  a  large  number  of  iterations  will  differ  from 
the  correct  ones.  The  situation  can  be  ameliorated  by 
implementing  a  restart  procedure  at  adaptively  determined 
intervals.  The  idea  is  to  compute,  at  intervals,  the  "true” 
gain  through  the  usual  forwards  or  backwards  growing- 
memory  update  algorithm  and  to  compare  the  result  with 
the  gain  provided  by  the  sliding-window  propagation.  The 
time  interval  to  the  next  such  chock  procedure  can  then  be 
increased  or  decreased  according  to  a  suitably  defined 
measure  of  the  difference  between  the  "true"  and  pro¬ 
pagated  gain.  (Obviously  the  sliding-window  nlgorilhms  will 
be  propagated  with  the  "true"  values  as  initial  conditions 
between  the  test  intervals,  i.c.  will  be  "restarted".)  The 
above  process  will  clearly  determine  the  period  of  Umc  at 
which  a  restart  is  necessary  in  order  to  keep  the  error  in 
the  gain  within  a  predetermined  bound. 


Thus  moving  horizon  control  design  is  a  computation¬ 
ally  feasible  me l hod  for  stabilizing  time  varying  systems. 
Tha  above  ideas  can  obviously  be  applied  to  discrete-time 
moving  horizon  control  problems,  sec  for  example  refer¬ 
ences  [5]  and  [10]. 

4.  CLOSED-LOOP  STABILITY  RESULTS 

As  pointed  out  earlier,  it  is  a  well-known  result  that  the 
feedback  control  law  with  A'(f .“)  provides  an  asymptotically 
, stable  closed-loop  system,  under  uniform  controllability  and 
observability  assumptions.  It  is  not  obvious  at  -this  point, 
however,  whether  the  sliding  horizon  controls  can  stabilize  a 
general  time-varying  system. 

The  following  simple  example  is  quite  suggestive.  Let  us 

consider  a  scalar,  time-invariunt  system  z  =  ax +6 u.  with 

t+r 

the  modified  receding  horizon  cost  Jn  =  J  (qx*+ru*)dr.  It 

i 

can  be  shown  by  straightforward  algebra  that,  the  closed 
loop  system  is  stable  when  T  >  (2aa)~'ln[(a+l)/(a-l)]  for 
a  >  0.  where  a  =  (l+gbVra*)1'* 

This  example  shows  that  even  without  terminal  weight 
there  exists  a  finite  horizon  that  stabilizes  the  system.  In 
the  general  infinite  horizon  case  the  terminal  weighting 
matrix  plays  no  role  and  can  be  arbitrarily  set  to  zero.  Since 
the  method  we  describe  introduces  a  finite,  sliding  horizon 
concept,  the  weighting  matrix  F,.  which  is  essentially  a 
design  parameter,  plays  a  crucial  role  in  determining  the 
properties  of  the  resulting  control  law.  In  the  sequel  we 
shall  discuss  three  different  choices  for  Ft: 

Case  1:  F,  =  0  (4.1a) 

Case  2:Ft  ~  "  "(4.1b) 

Case  3: 

[Ft:  Ft  >  0.  F,  +A, Ft  +  Ft At- Ft Bt  Rr'B,F,+  Q,  0J  (4.1c) 

'  In  the  above  classification  the  infinite  final  weighting 
can.  of  course  be  considered  as  a  particular  case  or  (4.1c). 
It  has  a  special  value  however,  being  the  most  useful  one  Tor 
practical  design,  as  will  become  clear  later. 

The  existence  of  a  finite,  though  possibly  large  horizon 
for  which  the  control  law  stabilizes  the  system  can  easily  be 
proven  for  the  time-invariant  case.  We  have  the  following 
result. 

THEOREM  4  .1 

If  (As  pair  is  controllable  and  we  have 

F  =  0,9  >  0.R  >  0,  then  there  exists  a  finite  horizon  T.  such, 
that  the  moving  horizon  control  law  (2.6)  stabilizes  the  sys¬ 
tem. 


PROOF:  The  closed  loop  system  is  given  by 


^-*(0  =  [A  -  BR-lffKT]x(t) 

where  A>  is  obtained  from  (2.9).  Let  K  be  the  solution  of  the 
algebraic  Riccati  equation,  i.c  the  steady-slate  feedback 
gain.  Lot  Kj*  -  K  —  Kj.  Then  the  closed-loop  system  can  be 
written  as 

-£-*(l)  =  [A  —  BR~*l?R)z(t)  ♦  BR~xBKf*x{t) 
ett 


Now,  since  A-DR~XB‘K  is  a  stable  matrix,  it  is  sufficient  to 
show  that  the  "perturbation  trrm”  |  \BR~lB  Hr* *  :  /  !  lx  : ! 
can  be  made  arbitrarily  small  for  some  T.  But  it  is  known 
that  Kt<Ktt  for  t ,<f g  and  that  Kt+K  as  F-»«>.  thus 


and  therefore  there  exists  a  finite  T  such  that  the  perturba¬ 


tion  is  arbitrarily  close  to  zero.  This  completes  the  proof.  * 


The  existence  of  a  possibly  very  large  horizon  for  which 
the  modified  control  yields  a  stable  closed-loop  system,  for 
the  general  time-varying  case,  can  be  shown  using  a  similar 
approach.  Intuitively  it  is  clear  that,  for  a  very  large  T,  the 
value  of  K(t.l  +  T;F)  approaches  K(t,<“)  to  an  arbitrary 
degree.  This  provides  a  control  that  differs  very  little  from 
the  steady-state  feedback  law,  which  is  known  to  stabilize 
the  time-varying  system. 

The  case  of  zero  Anal  weighting  leads  thus  to  generally 
large  horizons  and  also  problems  in  determining  a  suitable 
depth  T.  The  case  of  F,  -  "  for  which  we  actually  give  a 
method  for  choosing  the  horizon  depth,  that  turns  out  to  be 
connected  to  the  controllability  properties  of  the  system  to 
be  stabilized,  is  therefore  of  greater  practical  interest.  In 
order  to  slate  the  results  in  this  case  we  recall  the  following 

DEFINITION 

The  pair  is  uniformly  completely  controll¬ 

able  if  for  some  d  >0  the  following  inequalities  hold  for  all  t 

1)  a,/s  W(t.t+6)*a2l  (4.2) 

2)  ||p(t,.t,)||  (*■  3) 

fn  the  above  expressions  <p{r,a)  stands  for  the  state  transi¬ 
tion  matrix  of  A(t ).  a.  "s  are  positive  constants,  y(.)  maps  R 
into  R  and  is  bounded  on  bounded  intervals  and  IK(r,d)  is 
the  controllability  matrix  defined  as: 

•e 

W(tx.t,)  *  f<p(tx,o)BtBmp'(tx.o)da  (4.4) 


We  now  have  the  following  result  on  the  stability  of  mov¬ 
ing  horizon  control  laws  with  Ft  =  ■».  which  was  first  proved 
in  [4]. 

THEOREM  4.2 

If  the  pair  \At.Bt  |  is  uniformly  completely  controllable, 
and  if  On  Q,  <0+1  and  aBInRlnatl  then  for  any  T  >  6  the 
moving  horizon  control  taw  (2.10)  stabilises  the  system 
(2.1). 

Now.  in  addition  to  the  previous  particular  cases  of  final 
weighting  a  somewhat  more  general  class  of  final  weighting 
sequences  will  be  discussed.  The  most  general  case  of  an 
arbitrary  sequence  Fx  is  extremely  difficult  to  handle  due  to 
a  lack  of  known  monotone  properties  for  solutions  of  Riccati 
equations.  Therefore  we  shall  investigate  the  class  of 
sequences  defined  by  (4.1c).  It  may  be  that  with  further 
effort  the  above  results  can  be  extended  to  more  general 
situations,  as  has  been  done  tor  time-invariant  systems  in 
[16]-[17].  for  example. 

In  order  to  prove  the  main  result,  some  properties  of 
Riccati  equations  have  to  be  established  first.  These  are 
summarized  in  the  following  lemma  (see  also  [ll]). 

LEUUA  4  1 

1)  If  the  matrix  F,  belongs  to  the  class  defined  by 
(4.1c).  then  K(t.o).  the  solution  of  the  backward  Riccati 
equation  (2. 4)  satisfies  the  following  inequalities: 

K(t,ox;F0i)  fc  K(r.o2iF,t)  for  T<a,s«i  (4.5) 

2)  If  the  pair  \A,.B,i  is  uniformly  completely  controll¬ 
able  and  for  all  t,  as  I  <  ft  s  a4  /  and  a#/  <  R,  <  a»f  then  for 
any  T  such  that  t  <  T  <  m  there  there  exist  positive  con¬ 
stants  a?  and  at  such  that 

a,/  <  KU.t*T.r„r)  <  a,/  (4.8) 

PROOF:  1)  We  shall  derive  this  result  using  some  mono¬ 
tone  properties  of  Riccati  equations.  Consider  two 


backwards  Riccati  equations  as  follows: 

~dTU <0  *  U(t)At+A,/J(t)-M(t)Blilt-lB,tlHt)+Qt  (4.7a) 

"dt"A,(l>  =  Mt)A'*AtffO)-N(t)B,Rr'B’,N(t)*<J,*  (4.7b) 

with  final  conditions  given  respectively  by 

MUj)  =  Fx  and  N(t,)  =  Ft 

Now  it  is  readily  realized  that  if  Fx  =  and  Q, 1  a:  ft*,  then 
U(t )  »  N(t )  for  all  t  £  tj.  Similarly,  we  can  show  that  pro¬ 
vided  ft*  =  ft*.  Fx>Fg  implies  M(t)3t  N(t)  at  all  t<tf.\ 
These  inequalities  immediately  provide  (4.S)  since  by  choos¬ 
ing  X‘(e,.oa:F’,|)  =  F,t  we  shall  have  A(o,.o*;F,t)  s  F.t  by 

(4.1e)  and  therefore  K(t.ox  F,t)>  K(t,Oz:F0J  is  implied  by 
the  fact  that  K(ol.ox:F.l)=F.l  s  A :(ox.oa;F,J. 

2)  The  result  or  the  second  part  of  this  lemma  may  be 
obtained  by  a  slight  modification  of  the  arguments  given  in 
[2].[12]-[14j  and  the  definition  of  uniform  complete  control-  ] 
lability.  ■ 

Let  us  give  an  interpretation  to  condition  (4.1c).  Note 
that  this  condition  implies  that  the  matrix  Fx  satisfies  the 
following  differential  equation  (of  course  under  the  assump¬ 
tion  that  its  evolution  is  differentiable) 

~dTFt  =  FtAt  +  A*  F'  ~F* Dt  R*  ~Xp'  F‘ +  Q  ^ i  (4  8) 

for  some  matrix  sequence  H,.  Let  K(t,tf  ;Ftj)  be  the  solution 
of  the  backwards  equation  (2.4)  with  Anal  condition  Ft  . 
Then  we  have 

Ft>  K(t  .t,\Ft/)  for  ail  t<t,  (4.9) 

and  thus  we  can  state  that  the  steady-state  solution  for  the 
gain  has  to  be  a  lower  bound  on  the  matrix 

sequences  of  class  (4.1c).  We  are  now  in  a  position  to  prove 
the  following  result 

THEOREM  4  .3 

If  Ft  is  a  matrix  sequence  of  class  (4.1c).  R,  and  ft 
satisfy  the  conditions  of  lemma  4. 1  and  the  pair  (4  ./?»!  i* 
uniformly  completely  controllable,  then  for  any  fixed  T 
such  that  6  <  T  as  «•  ,  the  control  (2.6)  yields  a  closed  loop 
system  that  is  uniformly  asymptotically  stable. 

PROOF:  Let  us  consider  the  adjoint  of  the  closed-loop 
system,  with  the  feedback  gain  given  by  (2.6),  where  the 
final  weighting  sequence  is  of  class  (4.  lc) 

“*.(0  =  -A',  x.(f)  (4.10) 

Here  A*  *  A, -B, R, ''B", Kt  where  Kx  =  K(l,t  +  T,Ft*T). 

Further  consider  an  associated  scalar-valued  function 

P(*..f)  =  x'.(t)AT‘*.(0 
which,  by  part  2)  of  lemma  4.1,  satisfies 

o»l  I*.  1 1**  P(*».t)  s  aiol  I*# 1 1* 

Therefore,  the  above  defined  function  is  a  positive  definite 
function  of  the  adjoint  system  state.  Now  the  asymptotic 
stability  of  the  original  closed-loop  system  Is  guaranteed  if 
the  adjoint  system  state  vector  is  exponentially  increasing 
(i.e.  the  adjoint  system  is  asymptotically  unstable).  Uul  we 
can  show  that 

K*..t  )  =  x'.J  -A,  M,  *20,  Rt'B,  ♦ 

=  x'.l*,  R,-'B-t*Kt-'Qt  K,-'*  ~A~,(f.e;F.)!..„r|z. 

and  using  the  result  part  1).  lemma  4.1.  on  the  monotone 
properties  of  the  gain  w.r.t.  changes  In  the  second  parame¬ 
ter.  we  obtain 

JfK*a.O  >  x\HtRt-'lT,x,  (4.11) 


Now,  let  +*(r.e)  be  the  transition  matrix  of  the  closed-loop 
system  *  =  A<a:.  From  (4. 1 1)  it  follows  that 

2-.  (1  o)[  /«» (f  0. * )«« lit' '  V i *' *  (' 1 o.O<«  1*.  (<  o)  (4.12) 

*  •• 

*  oul|x.(to)i  *  for  t.fc/o+d  (4.13) 

for  some  positive  constant  a,,-  The  inequality  (4.13)  results 
from  the  fact  that  the  closed-loop  system  is  uniformly  con¬ 
trollable  [12].  since  there  exists  some  positive  constant  ata 
such  that 

t*< 

/ 1  !****“« 

< 

But.  (4.12)  implies  that  the  adjoint  system  increases 
exponentially,  which  in  return  guarantees  the  asymptotic 
stability  of  the  original  system  with  the  feedback  under  con¬ 
sideration.  This  completes  the  proof.  • 

The  above  theorem  indicates  that  there  exists  a  whole 
class  of  terminul  weighting  matrix  sequences  that,  with  a 
horizon  even  slightly  greater  than  the  controllability  inter¬ 
val  S.  yield  stabilizing  control  laws.  From  the  above  results 
it  is  also  conjectured  that  large  horizons  are  necessary  if 
the  final  weighting  matrix  is  small,  whereas  for  sequences  of 
large  final  weightings  the  stabilizing  horizon  approaches  the 
controllability  interval  6. 

The  above  results  simplify  somewhat  for  the  case  of 
time-invariant  systems.  The  class  (4.1c).  of  constant  final 
weighting  matrices  is  readily  seen  to  be  the  set  of  final 
weighting  matrices  satisfying  F  >  Km.  the  solution  of  the 
algebraic  Riccati  equation  associated  with  the  control  prob¬ 
lem. 

5.  SOME  FURTHER  TOPICS  AND  RESULTS 

&.l.  Cost  Incurred  by  moving  horizon  laws 

The  moving  horizon  control  dynamically  redefines  the 
performance  criterion  and  therefore  it  clearly  does  not 
optimize  any  overall  criterion.  Since  the  standard  cost  or 
the  type  (2.2)  is  an  accepted  measure  for  the  performance 
or  a  control  law  it  is  useful  to  determine  bounds  on  the  cost 
incurred  by  the  modified  control  laws.  Indeed,  we  can  prove 
the  following 

THEOREMS .1 

The  standard  quadratic  cost  incurred  by  the  eliding 
horizon  control  has  the  following  bound 

fix-  ( t  )ft  X  «  )+u-  (f  )R,  U  (t  )]df 

ssx,(f1)X’(fi.<«+7':/’,|»r)*(ft)  (sl) 

PROOF:  The  quadratic  cost  for  a  linear  feedback  control 
of  the  type  (2.3)  is  given  by  x(f<).'.f(f«.f/ )*((»)•  where 
U  (1  ,tj)  obeys  the  following  backwards  recursion 

=  K,M{l.tj)  +  M{t.t,)b 

at 

*  K(t  ,t  *  T;Ft,T)B>  Hi  ''tfi  *(<  •<  +  T;F, »r)+ ft 
with  the  final  condition  #(</.</)  =  Ft,-  From  (3.10)  we  have 
-~-K(IJ  +  r:F,*r)  •  KiKlt.t*T:F,tT)+KlU  +  7’:/’«*r)Af  + 

I ft 

K(t  .<  ♦  r;fi .  r  )ft  K,  ~xBt  K[t  .t  +  T.F,.T)+Qt-  I  e**r 

Now  define  E{t)  =  .If (I. !,)-*■(«.*  ♦T;Fl,r).  H  obviously 
satisfies  the  following  equation 

-■jfF(t)  -  K,EH)*Elt)Si*  £j*lt.o.F')\'.,*T 


with  boundary  condition  at  lj  given  by  Fit—K(tf  lj  ¥  T;Ft>r). 
Integrating  the  above  equation,  and  taking  into  considera¬ 
tion  the  fact  the  last  term  is  negative  (by  lemma  4.1),  we 
obtain 

*(*.)  <  *,(»/.‘<)[/i/'A'(f/.f/  +  r:/'</.r))$(«/-fi)  (5-2) 

Since  the  closed-loop  system  was  proved  asymptotically 
stable  we  shall  have  Eltt)-*0  as  f/-»*».  Therefore 
M It ,»)  £  K(t ,t  +T;Fn r)  which  establishes  the  desired 
result.  ■ 

Further  work  on  moving  horizon  controls  is  still  needed 
to  see  whether  It  also  inherits  more  of  the  nice  features  of 
“optimal"  feedback  controls,  as  tor  example  robustness  and 
good  sensitivity  properties. 

5.2.  A  structurally  dual  state  estimator 

Since  it  is  well-known  that  there  exists  a  duality 
between  optimal  linear  regulator  problems  and  optimal 
state  estimation,  one  may  ask  what  state  estimation  fitter  is 
the  dual  of  the  moving  horizon  control.  The  answer  is 
indeed  simple.  Structurally,  the  dual  system  is  a  Kalman 
filter  with  a  sliding  window  gain  defined  as  the  solution  of  a 
forward  Riccati  equation  over  a  fixed-length  interval 
(t-T.t).  This  estimator  is  a  suboptimal  state  reconstruc¬ 
tion  fitter  and  can  be  analyzed  by  the  methods  employed 
above  in  order  to  establish  stability  results.  Also,  obviously, 
the  gain  update  equations  can  be  derived  in  a  similar  way. 

An  important  fact  to  realize,  however,  is  that  this  filter 
does  not  provide  the  "best"  estimate  of  the  state  given  the 
observations  over  the  sliding  interval.  It  is  not  a  solution  to 
the  limited  memory  state-space  filtering  problem,  as  posed 
for  example  in  Jazwinski  [15],  since  it  does  not  "completely 
forget"  data  beyond  t-T.  This  can  be  easily  seen  from  the 
fact  that  the  filter  has  infinite  impulse  response.  Scattering 
theory  does  however  provide  a  solution  to  this  problem,  too. 
The  solution  involves  the  application  of  an  idea  similar  to 
the  one  that  led  to  efficient  gain  update  formulas  to  an 
extended  scattering  matrix.  These  results  are  presented  in 
[10). 

6.  CONCLUDING  REMARKS 

A  general  and  computationally  feasible  method  of  sta¬ 
bilizing  time-varying  linear  systems  through  state  feedback 
was  presented.  A  variant  of  the  method,  the  case  of  infinite 
final  weighting,  was  previously  analyzed  in  [4],  and  the  sys¬ 
tem  stabilizing  property  of  the  resulting  control  law  were 
established.  This  paper  further  generalizes  the  moving  hor¬ 
izon  method  to  a  whole  class  of  final  weighting  matrices  and 
also  provides  explicit  gain  update  algorithms  which  render 
the  method  more  efficient  computationally. 

Several  extensions  of  the  method  are  possible.  We  could 
for  example  deal  with  a  time  varying  horlzoh  depth  Tt,  and 
it  is  should  be  clear  that  the  approach  presented  in  section 
3  easily  yields  the  gain  update  algorithms  for  this  case  too. 
One  might  wish  to  change  the  horizon  depth  to  adapt  to 
varying  controllability  properties  of  the  system  under  con¬ 
trol.  The  case  of  discrete  time  systems  can  easily  be 
treated  within  the  same  framework  the  results  being  some¬ 
what  more  involved  algebraically  but  not  conceptually  (see 
(or  example  [5].[10]  and  also  [15]). 
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